Enhancing Survivability with Proactive Fault-Containment
نویسنده
چکیده
Realistic survivable systems must assume that faults will occur within the system. When a malicious fault is activated, it may work to cause damage and to spread; until the system has recovered from this damage, it will have a lower degree of survivability than it did before the fault occurred. By proactively containing faults that would otherwise spread throughout the system, we can reduce the amount of potential damage to the system, and thereby maintain system survivability. Enabling proactive survivability carries with it a number of challenges, including the need to quantify survivability in order to justify the potential overhead of the proactive mechanisms, the need to select appropriate fault detection strategies, and the need to address runtime problems like deciding when and where to focus proactive effort.
منابع مشابه
Metrics for the Evaluation of Proactive and Reactive Survivability∗
Current Byzantine-fault-tolerant survivable systems [5, 6] rely on strong theoretical properties to guarantee survivability. Evaluations of such systems generally focus on the performance overhead of the mechanisms in the fault-free case: a metric that, in itself, is not a good evaluator of survivability. This dearth of metrics makes the objective comparison of the survivability of different im...
متن کاملProactive Containment of Malice in Survivable Distributed Systems
The uncontrolled propagation of faults due to malicious intrusion can severely decrease system performance and survivability. Our goal is to employ available information about known or suspected faults in order to provide collusionavoidance and epidemic-avoidance. We proactively make use of knowledge of faults to notify potentially damaged areas of the system, in order to contain the tainted pa...
متن کاملSurvivability Enhancing Techniques for RFID Systems
Radio Frequency Identification (RFID) has been applied in various high security and high integrity settings. As an important ubiquitous technique, RFID offers opportunities for real-time item tracking, object identification, and inventory management. However, due to the high distribution and vulnerability of its components, an RFID system is subject to various threats which could affect the sys...
متن کاملDesign Patterns for Fault Containment
Fault containment is an important constituent of fault tolerance. Means for fault containment allow a system to limit the impact of manifested faults to some predefined system boundaries. This document presents some of the best known techniques for fault containment formatted as design patterns. These patterns are elicited from the areas of self-stabilization, specification closure and fault to...
متن کاملA Framework For Proactive Fault Tolerance12
Fault tolerance is a major concern to guarantee availability of critical services as well as application execution. Traditional approaches for fault tolerance include checkpoint/restart or duplication. However it is also possible to anticipate failures and proactively take action before failures occur in order to minimize failure impact on the system and application execution. This document pre...
متن کامل